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PROCESS FOR OXIDISING AROMATIC COMPOUNDS 
The invention relates to a process for enzymatically oxidising halogenated 
aromatic compounds. 

Chlorinated aromatic compounds such as the chlorobenzene and polychlorinated 
5 biphenyls (PCBs) are among the most wide-spread organic contaminants in the 

environment due to their common application as solvents, biocides, and in the heavy 
electrical industry. They are also some of the most problematic environmental pollutant, 
not only because of the health hazards (lipid solubility and hence accumulation in fatty 
tissues, toxicity and carcinogenicity) but also because of their slow degradation in the 
10 environment. 

Whilst microorganisms have shown extraordinary abilities to adapt and evolve to 
degrade most of the organic chemicals released into the environment, the most chemically 
inert compounds such as PCBs do persist for two main reasons. First, these compounds 
have very low solubility in water and therefore their bioavailability is low. Research into 
15 this problem has focussed on the use of detergents and other surfactants to enhance their 
solubility and bioavailability. Second, these compounds require activation by enzymatic 
oxidation or reduction, and it can take a long time for the necessary genetic adaptations by 
microorganisms to occur, and even then the organisms may not be stable and viable. 

We have now found, according to the present invention, that a monoxygenase, in 
20 particular P450 cam and its physiological electron transfer partners putidaretoxin and 

putidaretoxin reductase, can be used to oxidise halogenated aromatic compounds. Also 
mutants of the monoxygenase with substitutions in the active site have enhanced oxidation 
activity. Thus suitable monoxygenases can be expressed in microorganisms, animals and 
plants which are going to be used to oxidise the halogenated aromatic compounds. 
25 Accordingly the present invention provides a process for oxidising a substrate 

which is a halo aromatic compound, which process comprises oxidising said substrate with 
a monooxygenase enzyme. 

The process may be carried out in a cell that expresses: 
(a) the enzyme 
30 (b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 
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The halo aromatic compound is typically a benzene or biphenyl compound. The 
benzene ring is optionally fused and can be substituted. The halogen is typically chlorine. 
In many cases there is more than one halogen atom in the molecule, typically 2 to 5 or 6, 
for example 3 Generally 2 of the halogen atoms will be ortho or para to one another. The 
compound may or may not contain an oxygen atom such as a hydroxy group, an aryloxy 
group or a carboxy group. The compound may or may not be chlorophenol or a 
chlorophenoxyacetic compound. 

Specific compounds which can be oxidised by the process of the present 
invention include 1, 2; 1,3- and 1 ,4-dichlorobenzene, 1, 2, 4; 1, 2, 3- and 1, 3, 5- 
trichlorobenzene, 1, 2, 4, 5- and 1, 2, 3, 5- tetrachlorobenzene, pentachlorobenzene, 
hexachlorobenzene, 3,3*-dichlorobiphenyl and 2, 3, 4, 5, 6- and 2, 2\ 4, 5, 5'- 
pentachlorobiphenyl. 

Other compounds which can be oxidised by the process include recalcitrant halo 
aromatic compounds, especially dioxins and halogenated dibenzofurans, and the 
corresponding compounds where one or both oxygen atoms is/are replaced by sulphur, in 
particular compounds of the formula: 




which possess at least one halo substituent, such as dioxin itself, 
2, 3 ,7, 8-tetrachlorodibenzodioxin. 

The oxidation typically gives rise to 1, 2 or more oxidation products. These 
oxidation products will generally comprise 1 or more hydroxyl groups. Generally, 
therefore, the oxidation products are phenols which can readily be degraded. It is 
particularly noteworthy that pentachlorobenzene and hexachlorobenzene can be oxidised in 
this way since they are very difficult to degrade. In contrast the corresponding phenols can 
be readily degraded by a variety of Pseudomonas and other bacteria. The atom which is 
oxidized is generally a ring carbon. 

The enzyme is typically a natural monooxygenase or a mutant thereof. The 
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natural monooxygenase is generally a prokaryotic or eukaryotic enzyme. Typically it is a 
haem-containing enzyme and/or a P450 enzyme. The monooxygenase may or may not be 
a TfdA (2,4-dichlorophenoxy) acetate/a-KG dioxygenase. The monooxygenase is 
generally of microorganism (e.g. bacterial), fungal, yeast, plant or animal origin, typically 
of a bacterium of the genus Pseudomonas. These organisms are typically soil, fresh water 
or salt water dwelling. In the case of a mutant monooxygenase the non-mutant form may 
or may not be able to oxidize the substrate. 

The monooxygenase typically has a coupling efficiency of at least 1%, such as at 
least 2%, 4%, 6% or more. The monooxygenase typically has a product formation rate of 
at least 5 min" 1 , such as at least 8, 10, 15 , 20, 25, 50, 100, 150 min" 1 or more. The coupling 
efficiency or product formation rate is typically measured using any of the substrates or 
conditions mentioned herein. Thus they are typically measured in the in vitro conditions 
described in Example 2, in which case the relevant monooxygenase, reductase and redoxin 
would be present instead of, but at the same concentration as, P450 cam , putidaretoxin 
15 reductase and putidaretoxin. 

The mutant typically has at least one mutation in the active site. A preferred 
mutant comprises a substitution of an amino acid in the active site by an amino acid with a 
less polar side chain. Thus the amino acid is typically substituted with an amino acid 
which is above it in Table 1. 



10 
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Table 1 HYDROPATHY SCALE FOR AMINO ACID SIDE CHAINS 



Side Chain Hydropathy 



lie 




Val 


4 2 


Leu 


3 8 


Phe 


2.8 


Cys 


2.5 


Met 


1.9 


Ala 


1.8 


Gly 


-0.4 


Thr 


-0.7 


Ser 


-0.8 


Tip 


-0.9 


Tyr 


-1.3 


Pro 


-1.6 


His 


-3.2 


Glu 


-3.5 


Gin 


-3.5 


Asp 


-3.5 


Asn 


-3.5 


Lys 


-3.9 


Arg 


-4.5 



An amino acid 'in the active site' is one which lines or defines the site in which 
the substrate is bound during catalysis or one which lines or defines a site through which 
the substrate must pass before reaching the catalytic site. Therefore such an amino acid 
typically interacts with the substrate during entry to the catalytic site or during catalysis. 
Such an interaction typically occurs through an electrostatic interaction (between charged 
or polar groups), hydrophobic interaction, hydrogen bonding or van der Waals forces. 

The amino acids in the active site can be identified by routine methods to those 
skilled in the art. These methods include labelling studies in which the enzyme is allowed 
to bind a substrate which modifies (Mabels') amino acids which contact the substrate. 
Alternatively the crystal structure of the enzyme with bound substrate can be obtained in 
order to deduce the amino acids in the active site. 
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The monooxygenase typically has 1, 2, 3, 4 or more other mutations, such as 
substitutions, insertions or deletions. The other mutations may be in the active site or 
outside the active site. Typically the mutations are in the 'second sphere' residues which 
affect or contact the position or orientation of one or more of the amino acids in the active 
site. The insertion is typically at the N and/or C terminal and thus the enzyme may be part 
of a fusion protein. The deletion typically comprises the deletion of amino acids which are 
not involved in catalysis, such as those outside the active site. The monooxygenase may 
thus comprise only those amino acids which are required for oxidation activity. 

The other mutations in the active site typically alter the position and/or 
conformation of the substrate when it is bound in the active site. The mutation may make 
the site on the substrate which is to be oxidized more accessible to the haem group. Thus 
the mutation may be a substitution to an amino acid which has a smaller or larger, or more 
or less polar, side chain. 

The other mutations typically increase the stability of the protein, or make it 
easier to purify the protein. They typically prevent the dimerisation of the protein, typically 
by removing cysteine residues from the protein (e.g. by substitution of cysteine at position 
334 of P450 eam , or at an equivalent position in a homologue, preferably to alanine). They 
typically allow the protein to be prepared in soluble form, for example by the introduction 
of deletions or a poly-histidine tag, or by mutation of the N-terminal membrane anchoring 
sequence. The mutations typically inhibit protein oligomerisation, such as oligomerisation 
arising from contacts between hydrophobic patches on protein surfaces. 

Typically the mutant monoxygenase is at least 70% homologous to a natural 
monooxygenase on the basis of amino acid identity. 

Any of the homologous proteins mentioned herein are typically at least 70% 
homologous to a protein or at least 80 or 90% and more preferably at least 95%, 97% or 
99% homologous thereto over at least 20, preferably at least 30, for instance at least 40, 60 
or 100 or more contiguous amino acids. The contiguous amino acids may include the 
active site. This homology may alternatively be measured not over contiguous amino acids 
or nucleotides but over only the amino acids in the active site. 
The monoxygenase is preferably: 

(i) P450 CMn , 

(ii) a naturally occurring homologue of (i), 
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(iii) a mutant of (i) or (ii). 

Typically (i) is any allelic variant of P450,.,,, of Pseudomonas putida (e.g. of the 
polypeptide sequence shown in SEQ ID No. 1). Typically (ii) is a species homologue of (i) 
which has sequence homology with (i), and is typically P450 BM . 3 of Bacillus megaterium 
(e.g. the polypeptide sequence shown in SEQ ID No. 2), P450 teip of Pseudomonas sp, 
P450 eiyF of Saccharopollyspora erythraea, or P450 105 Dl (CYP105) of Streptomyces 
griseus strains. 

The active site of (ii) or (iii) may be substantially the same as the active site of (i) 
or any of the mutants of (i) mentioned herein. Thus the site may comprise the same amino 
acids in substantially the same positions. 

Typically in (iii) amino acid 96 of P450 cam , or the equivalent amino acid in a 
homologue, has been changed to an amino acid with a less polar side chain. 

The 'equivalent' side chain in the homologue is one at the homologous position. 
This can be deduced by lining up the P450 eim sequence and the sequence of the homologue 
based on the homology between the two sequences. The PILEUP, BLAST and BESTFIT 
algorithms can be used to line up the sequences (for example as described in Altschul S. F. 
(1993) J Mol Evol 36:290-300; Altschul, S, F etal (1990) J Mol Biol 215:403-10 and 
(Devereux et al (1984) Nucleic Acids Research 12, p387-395)). These algorithms can also 
be used to calculate the levels of homology discussed herein (for example on their default 
settings). The equivalent amino acid will generally be in a similar place in the active site 
of the homologue as amino acid 96 in P450 CMn . 

The discussion below provides examples of the positions at which substitutions 
may be made in P450 cam . The same substitutions may be made at equivalent positions in 
the homologues. Standard nomenclature is used to denote the mutations. The letter of the 
amino acid present in the natural form is followed by the position, followed by the amino 
acid in the mutant. To denote multiple mutations in the same protein each mutation is 
listed separated by hyphens. The mutations discussed below using this nomenclature 
specify the natural amino acid in P450 cam , but it is to be understood that the mutation could 
be made to a homologue which has a different amino acid at the equivalent position. Note 
that the amino acid numbering shown in SEQ ID No. 1 for P450 csm does not correspond to 
the numbering used in the description to denote mutations. The numbering in SEQ ID No. 
1 is one more than the numbering in the description for a particular position. 
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An additional mutation is typically an amino acid substitution at amino acid 87, 
98, 101, 185, 244, 247, 248, 296, 395, 396 or a combination of these, for example as 
shown in table 2. 

The following combinations of substitutions are preferred: 

(i) Substitution at position 87 to amino acids of different side-chain volume, 
such as substitutions (typically of F) to A, L, I and W, combined with substitutions at 
position 96 to amino acids of different side-chain volume such as (typically Y to) A, L, F, 
and W. These combinations alter the space available in the upper part of the substrate 
pocket compared to the wild-type enzyme, for example, from Y96W-F87W (little space) to 
Y96A-F87A (more space), as well as the location of the space, for example from one side 
in Y96F-F87A to the other in Y96A-F87W. 

(ii) Substitution at position 96 to F combined with substitutions at positions 
185 and 395. Both T185 and 1395 are at the upper part of the substrate pocket, and 
substitution with A creates more space while substitution with F will reduce the space 
available and push the substrate close to the haem. 

(iii) Substitutions at position 96 to A, L, F, and W combined with substitutions 
at residues closer to the haem including at 101, 244, 247, 295, 296 and 396 to A, L, F, or 
W. These combinations will create or reduce space in the region of the different side- 
chains to offer different binding orientations to substrates of different sizes. For example, 
the combinations Y96W-L244A and Y96L-V247W will offer very different pockets for the 
binding of the substrate. 

(iv) Triple substitutions at combinations of positions 87, 96, 244, 247, 295, 
296, 395 and 396 with combinations of A, L, F, and W. The aim is to vary the size and 
shape of the hydrophobic substrate binding pocket. For example, the Y96A-F87A-L244A 
combination creates more space compared to the Y96F-F87W-V396L combination, thus 
allowing larger substrates to bind to the former while restricting the available binding 
orientations of smaller substrates in the latter. The combinations Y96F-F87W-V247L and 
Y96F-F87W-V295I have comparable substrate pocket volumes, but the locations of the 
space available for substrate binding are very different. The combination Y96F-F87L- 
V247A has a slightly larger side-chain volume at the 96 position than the combination 
Y96L-F87L-V247A, but the L side-chain at the 96 position is much more flexible and the 
substrate binding orientations will be different for the two triple mutants. 
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(v) The mutants with four or five substitutions were designed with similar 
principles of manipulating the substrate volume, the different flexibility of various side- 
chains, and the location of the space available in the substrate pocket for substrate binding 
so as to effect changes in selectivity of substrate oxidation. 

Mutations are generally introduced into the enzyme by using methods known in 
the art, such as site directed mutagenesis of the enzyme, PCR and gene shuffling methods 
or by the use of multiple mutagenic oligonucleotides in cycles of site-directed mutagenesis. 
Thus the mutations may be introduced in a directed or random manner. Typically the 
mutagenesis method produces one or more polynucleotides encoding one or more different 
mutants. In one embodiment a library of mutant oligonucleotides is produced which can 
be used to produce a library of mutant enzymes. 

The process is typically carried out in the presence of the natural cofactors of the 
monooxygenase. Thus typically in addition to the enzyme (a) and the substrate the process 
is carried out in the presence of an electron transfer reductase (b), an electron transfer 
redoxin (c), cofactor for the enzyme and an oxygen donor. In this system the flow of 
electrons is generally: cofactor -* (b) (c) (a). 

(b) is generally an electron transfer reductase which is able to mediate the 
transfer of electrons from the cofactor to (c), such as a naturally occurring reductase or a 
protein which has homology with a naturally occurring reductase, such as at least 70% 
homology; or a fragment of the reductase or homologue. (b) is typically a reductase of any 
of the organisms mentioned herein, and is typically a flavin dependent reductase, such as 
putidaredoxin reductase. 

(c) is generally an electron transfer redoxin which is able to mediate the transfer 
of electrons from the cofactor to (a) via (b). (c) is typically a naturally occurring electron 
transfer redoxin or a protein which has homology with a naturally occurring electron 
transfer redoxin, such as at least 70% homology; or a fragment of the redoxin or 
homologue. (c) is typically a redoxin of any of the organisms mentioned herein, (c) is 
typically a two-iron/two sulphur redoxin, such as putidaredoxin. 

The cofactor is any compound capable of donating an electron to (b), such as 
NADH. The oxygen donor is any compound capable of donating oxygen to (a), such as 
dioxygen. 

Typically (a), (b) and (c) are present as separate proteins; however they may be 
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present in the same fusion protein. Typically only two of them, preferably (b) and (c), are 
present in the fusion protein. Typically these components are contiguous in the fusion 
protein and there is no linker peptide present. 

Alternatively a linker may be present between the components. The linker 
generally comprises amino acids that do not have bulky side chains and therefore do not 
obstruct the folding of the protein subunits. Preferably the amino acids in the linker are 
uncharged. Preferred amino acids in the linker are glycine, serine, alanine or threonine. In 
one embodiment the linker comprises the sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C. 
The linker is typically from at least 5 amino acids long, such as at least 10, 30 or 50 or 
more amino acids long. 

In the process the concentration of (a), (b) or (c) is typically from 10" 8 to 
10" 2 M, preferably from lO" 6 to lO^M. Typically the ratio of concentrations of (a): (b) 
and/or (a): (c) is from 0.1:01 to 1:10, preferably from 1:0.5 to 1:2, or from 1:0.8 to 1:1.2. 
Generally the process is carried out at a temperature and/or pH at which the enzyme is 
functional, such as when the enzyme has at least 20%, 50%, 80% or more of peak activity. 
Typically the pH is from 3 to 1 1, such as 5 to 9 or 6 to 8, preferably 7 to 7.8 or 7.4. 
Typically the temperature is 10 to 90°C, such as 25 to 75°C or 30 to 60°C. 

In the process different monooxygenases may be present. Typically each of 
these will be able to oxidise different substrates, and thus using a mixture of 
monooxygenases will enable a wider range of substrates to be oxidised. 

In one embodiment the process is carried out in the presence of a substance able 
to remove hydrogen peroxide by-product (e.g. a catalase). 

In one embodiment the process is carried out in the presence of the enzyme, 
substrate and an oxygen atom donor, such as hydrogen peroxide or t-butylhydroperoxide, 
for example using the peroxide shunt. 

In one embodiment in the process the (a), (b) and (c) together are typically in a 
substantially isolated form and/or a substantially purified form, in which case together they 
will generally comprise at least 90%, e.g., at least 95%, 98% or 99% of the protein in the 
preparation. 

The process may be carried out inside or outside a cell. The cell is typically in 
culture, at a locus, in vivo or in planta (these aspects are discussed below). 

The process is typically carried out at a locus such as in land (e.g. in soil) or in 
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water (e.g. fresh water or sea water). When it carried out in culture the culture typically 
comprises different types of cells of the invention, for example expressing different 
monooxygenases of the invention. Generally such cells are cultured in the presence of 
assimible carbon and nitrogen sources. 

Typically the cell in which the process is carried out is one in which the 
monooxygenase does not naturally occur. In another embodiment the monooxygenase is 
expressed in a cell in which it does naturally occur, but at higher levels than naturally 
occurring levels. The cell may produce 1, 2, 3, 4 or more different monooxygenases of the 
invention. These monoxygenases may be capable of oxidising different halo aromatic 
compounds. Typically the cell also expresses any of the reductases and/or redoxins 
discussed above. 

The cell is typically produced by introducing into a cell (i.e. transforming the cell 
with) a vector comprising a polynucleotide that encodes the monooxygenase. The vector 
may integrate into the genome of the cell or remain extrachromosomal. The cell may 
develop into the animal or plant discussed below. Typically the coding sequence of the 
polynucleotide is operably linked to a control sequence which is capable of providing for 
the expression of the coding sequence by the host cell. The control sequence is generally a 
promoter, typically of the cell in which the monooxygenase expressed. 

The term "operably linked" refers to a juxtaposition wherein the components 
described are in a relationship permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence is ligated in such a way that 
expression of the coding sequence is achieved under conditions compatible with the control 
sequences. 

The vector is typically a transposon, plasmid, virus or phage vector. It typically 
comprises an origin of replication. It typically comprises one or more selectable marker 
genes, for example an ampicillin resistance gene in the case of a bacterial plasmid. The 
vector is typically introduced into host cells using conventional techniques including 
calcium phosphate precipitation, DEAE-dextran transfection, or electroporation. 

Components (b) and (c) may be expressed in the cell in a similar manner. 
Typically (a), (b) and (c) are expressed from the same vector, or may be expressed from 
different vectors. They may be expressed as three different polypeptides. Alternatively 
they may be expressed in the form of fusion proteins. The cell typically expresses more 
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than one type of monooxygenase. 

In one embodiment the three genes encoding the three protiens of the P450 CJim 
system, i.e. camA, camB, and camC are placed in the mobile regions of standard 
transposon vectors and incorporated into the genome of Pseudomonas and flavobacteria. 
Alternatively plasmid vectors for expressing these genes may used, in which case the 
P450 cam gene cluster will be extra-chromosomal. 

The cell may be prokaryotic or eukaryotic and is generally any of the cells or of 
any of the organisms mentioned herein. Preferred cells are Pseudomanas, flavobacteria or 
fungi cells (e.g. Aspergillus). In one embodiment the cell is one which in its naturally 
occurring form is able to oxidise any of the substrates mentioned herein. Typically the cell 
is in a substantially isolated form and/or substantially purified form, in which case it will 
generally comprise at least 90%, e.g. at least 95%, 98% or 99% of the cells or dry mass of 
the preparation. 

The invention provides a transgenic animal or plant whose cells are any of the 
cells of the invention. The animal or plant is transgenic for the monooxygenase gene and 
typically also an appropriate electron transfer reductase and/or redoxin gene. They may be 
homozygote or heterozygote for such genes, which are typically transiently introduced into 
the cells, or stably integrated. (e.g. in the genome). The animal is typically a worm (e.g 
earthworm) or nematode. The plant or animal may be obtained by transforming an 
appropriate cell (e.g. embryo stem cell, callus or germ cell), fertilising the cell if required, 
allowing the cell to develop into the animal or plant and breeding the animal or plant true if 
required. The animal or plant may be obtained by sexual or asexual (e.g. cloning) 
propagation of an animal or plant of the invention or of the Fl organism (or any generation 
removed from the Fl, or the chimera that develops from the transformed cell). 

As discussed above the process may be carried out at a locus. Thus the invention 
also provides a method of treating a locus contaminated with a halo aromatic compound 
comprising contacting the locus with a monooxygenase, cell, animal or plant of the 
invention. These organisms are then typically allowed to oxidise the halo aromatic 
compound. In one embodiment the organisms used to treat the locus are native to the 
locus. Thus they may be obtained from the locus (e.g. after contamination), 
transformed/transfected (as discussed above) to express the monooxygenase (and 
optionally an appropriate electron transfer reductase and/or redoxin. 
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In one embodiment the locus is treated with more than one type of organism of 
the invention, e.g. with 2, 3, 4, or more types which express different monooxygenases 
which oxidise different halo aromatic compounds. In one embodiment such a collection of 
organisms between them is able to oxidise all halobenzenes, e.g. all chlorobenzenes. 

The organisms (e.g. in the form of the collection) may carry out the process of 
the invention in a bioreactor (e.g. in which they are present in immobilised form). Thus the 
water or soil to be treated may be passed through such a bioreactor. Soil may be washed 
with water augmented with surfactants or ethanol and then introduced into the bioreactor. 

The invention also provides a process for selecting a mutant of a monooxygenase 
for its ability to oxidise any of the substrates mentioned herein, which process comprises 
screening a library of said mutants for their oxidation effect on the substrate. Thus 
typically the substrate is provided to the library and mutants are selected based on their 
ability to oxidise the substrate, for example at a particular rate or under particular 
conditions. The mutant may be selected based on its ability to oxidise the substrate to a 
particular oxidation product. 

Typically the library will be in the form of cells which comprise the mutant 
enzymes. Generally each cell will express only one particular mutant enzyme. The library 
typically comprises at least 500 mutants, such as at least 1,000 or 5,000 mutants, preferably 
at least 10,000 different mutants. 

The library typically comprises a random population of mutants. The library 
may undergo one or more rounds of selection whilst being produced and therefore may not 
comprise a random population. 

The library is typically produced by contacting any of the cells discussed herein 
which expresses the monooxygenase with a mutagen and/or when the cell is a mutator cell 
culturing the cell in conditions in which mutants are produced. The mutagen may be 
contacted with the cell prior to or during culturing of the cell. Thus the mutagen may be 
present during replication of the cell or replication of the genome of the cell. 

The mutagen generally causes random mutations in the polynucleotide sequence 
which encodes (a). The mutagen is typically a chemical mutagen, such as 
nitrosomethyguanidine, methyl- or ethylmethane sulphonic acid, nitrite, hydroxylamine, 
DNA base analogues, and acridine dyes, such as proflavin. It is typically electromagnetic 
radiation, such as ultra-violet radiation at 260 nm (absorption maximum of DNA) and X- 
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rays. It is typically ionising radiation. 

A mutator cell is generally deficient in one or more of the primary DNA repair 
pathways (such as KColi pathways mutS, mutD or mutT, or their equivalents in another 
organism), and thus has a high mutation rate. Simply culturing such cell leads to the DNA 
5 encoding (a) to become mutated. The cell may be oiRColi XL1 Red mutator strain. 

The mutant selected from the library may be used in any aspect of the invention, 
thus it may be used to oxidise a substrate in the process of the invention or may be 
expressed in the cell, animal or plant of the invention. It may be used in the method of 
treating a locus. 

10 The invention is also illustrated by the Examples: 

Example 1 

Expression of mutants for in vitro work. 

The P450 cam enzymes were expressed using the vector pRH1091 (Baldwin, J.E., 
Blackburn, J.M., Heath, R.J., and Sutherland, J.D. Bioorg. Med Chem. Letts., 1992, 2, 

15 663-668.) which utilised the trc promoter (a fusion of the trp and lac promoters). This 

vector incorporates a strong ribosome binding site (RBS) and the gene to be expressed is 
cloned using an Nde I site on the 5* end of the gene. We used Hind III as the cloning site at 
the 3' end of the camC gene. The procedure for protein expression is as follows: Cells are 
grown at 30°C until the OD*^ reaches 1.0 - 1.2, the temperature is increased to 37°C 

20 and camphor added as a 1 M stock in ethanol to a final concetration of 1 mM. The culture 
is allowed to incubate at 37°C for another 6 hours. The P450 CMn protein is expressed to high 
levels in the cytoplasm and the cells take on a red to orange-red colour. 

We have also prepared a variant of pRH1091 (by PCR) which has a extra Xba I site 
between the RBS and the Nde I site. This is important because Nde I is not unique in Ml 3, 

25 and this restriction site is also present in the reductase gene as well as the backbone of the 
pGLWl 1 vector used for the in vivo system. Xba I is unique in the polylinker region of 
Ml 3, but absent in the genes of all three proteins in the P450 cam system and in the 
expression vectors. It therefore allows the camC gene to be moved between the mutagenic 
and expression vectors. 
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H w the mutants were made. 

Oligonucleotide-directed site-specific mutagenesis was carried out by the Kunkei 
method (Kunkei, T. A. Proc. Natl Acad Sci. USA 1985^ 82, 488-492) using the Bio-Rad 
Mutagen kit. The recommended procedure is summarised as follows. An M13 mpl9 
subclone of the camC gene encoding P450 cam was propagated in the K coli strain CJ236. 
This strain has the afe/Aing-phenotype and thus will tolerate the inclusion of uracil in place 
of thymine in DNA molecules. After three cycles of infection, uracil-containing single 
stranded (USS) M13 DNA was readily isolated by phenol extraction of mature M13 phage 
particles excreted into the growth medium. The mutagenic oligonucloetide (or 
oligonucleotides) were phosphorylated with T4 polynucleotide kinase and then annealed to 
the USS template. The four nucleotides, DNA polymerase, DNA ligase, ATP and other 
chemical components were added and the second strand was synthesised in vitro. The 
double stranded form thus obtained was transformed into the dut+ ung+ E. coli strain 
MV1 190, which should degrade the uracil-containing template strand and propagate the 
mutant strand synthesised in vitro. Plaques were picked and phages of possible mutants 
grown in K coli strains MV1 190 or TGI. The single-stranded DNA from these were 
sequenced to determine whether the mutagenesis reaction was successful. The mutagenic 
efficiency was 50 - 80%. 

The mutant camC gene is excised from the Ml 3 subclone by restriction digest with 
Nde I and Hind m, and the fragment of appropriate size is ligated to the backbone of the 
expression vector prepared by a similar Nde VHind III digest 

Multiple mutants were prepared either by further mutagenesis, also by the Kunkei 
method, or where the location of the sites in the sequence permits, simple cloning steps. 
There are two unique restriction sites within the camC gene which are absent from the 
expression vector. One is Sph I which spans residues 121 - 123, and the other is Sal I which 
spans residues 338 and 339. Therefore, all mutations at, for example, residues 87, 96, 98, 
and 101 are readily combined with mutations at higher number residues by ligating 
appropriate fragments from restriction digests of mutant camC genes with Nde VSph I and 
Sph VHind m and the backbone fragment from a Nde VSph I digest of the expression 
vector. Mutations at, for example, 395 and 396 can be similarly incorporated by digests in 
which Sph I is replaced with Sal I. 

The rationale for introducing the unique Xba I site is now clear: many mutants with 
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multiple mutations were prepared by the cloning procedure above. Without the Xba I site it 
would be impossible to clone the gene for these multiple mutants from the expression 
vector back into Ml 3 for further rounds of mutagenesis. Of course these problems could be 
overcome by doing mutagenesis by PCR, for example. 

5 . Example 2 

Substrate oxidation protocol: in vitro reactions 



Component Final concentration 

P450 cam enzyme 1 pM 

Putidaredoxin 10 

10 Putidaredoxin reductase 1 nM 

Bovine liver catalase 20 ng/ml 

KCl 200 mM 

Substrate Typically 1 mM 

NADH 250 - 400 pM 



* 50 mM Tris-HCl buffer pH 7.4 is added to make up the volume. 

* Temperature controlled at 30"C, optional 

* The NADH turnover rate could be determined by monitoring the absorbance at 
340 nm with time. 

* Catalase does not catalyse the substrate oxidation reactions but rather it is present 
to remove any hydrogen peroxide by-product which could otherwise denature the P450 CMn . 

The method can be increased in scale to, for example, 20 ml total incubation 
volume to allow purification of sufficient products by HPLC for spectroscopic 
characterisation. Fresh substrate (1 mM) and NADH (1-2 mM) are added periodically, 
such as every 20 minutes in a total reaction time of, typically, 3 hours. 

25 Example 3 

The in vivo system 

The in vivo systems were expressed using the vector pGLWl 1, a derivative of the 
plasmid pKK223 (Brosius, J. and Holy, A. Proc. Nail. Acad. Sci. USA, 1984, 81,6929- 
6933). Expression is directed by the tac promoter and the vector incorporates a gene 



15 



20 
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conferring resistance to the antibiotic ampicillin. 

Two systems were constructed. The first one expressed the electron transfer 
proteins putidaredoxin reductase (cam A gene) and putidaredoxin (camB gene) as a fusion 
protein with a seven amino acid peptide linker, and the P450 cam enzyme (camC gene) was 
expressed by the same vector but it was not fused to the electron proteins. The second 
system expressed the three proteins as separate entities in the KColi host. Both systems 
were catalytically competent for substrate oxidation in vivo. 

The general strategy was as follows. The genes for the three proteins were cloned 
using Eco RI and Hind HI as flanking sites, with Eco RI at the 5' end. For both in vivo 
systems there are restriction sites between the genes, including between the reductase and 
redoxin genes in the fusion construct. These restriction sites were introduced by PCR, as 
detailed below. The first task, however, was to carry out a silent mutation to remove the 
Hind HI site within the camA gene for the reductase. The AAGCTT Hind III recognition 
sequence in the camA gene was changed to AAGCCT, which is a silent mutation because 
GCT and GCC both encode alanine. The gene was completely sequenced to ensure that 
there were no spurious mutations. 



1. The fusion protein system 

La Manipulation of the camA gene by PCR 

For the camA gene the primer below was used at the 5' end of the gene to introduce 
the Eco RI cloning site and to change the first codon from GTG to the strong start codon 
ATG. 

5 1 - GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC GCA AAC 
EcoRl RBS \^camA 

At the 3' end of camA the primer was designed such that 15 bases are 
complementary to nucleotide sequence of the last five amino acid residues of camA. The 
stop codon immediately after the GCC codon for the last amino acid was removed, and 
then part of a seven amino acid linker (Thr Asp Gly Gly Ser Ser Ser) which contained a 
Bam HI cloning site (GGATCC = Gly Ser) was introduced. The coding sequence was 
thus: 
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S'- GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3' 
camA - Thr Asp Gly Gly Ser 

\Bamm\ 

The primer sequence shown below is the reverse complement used for PCR: 
5'- CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3' 

l.b Manipulations of the camB gene by PCR 

For the camB gene the primer at the 5' end incorporated the second half of the 
peptide linker between the reductase and redoxin proteins, and the restriction site Bam HI 
for joining the two amplified genes together. 

5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT-3' 
Gly Ser Ser Ser | - camB 
\Bamm\ Start 

At the 3' end of camB the primer incorporates 12 nucleotides complementary to the 
end of camB followed by the stop codon TAA, a 6 nucleotide spacer before the GGAG 
ribosome binding site. Xba I and Hind m sites were then added to allow cloning of the 
camC gene when required. The sequence of the coding strand was therefore: 
5'- CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT TCA TCG-3' 
CamB - 1 stop KBSXbal Hindm 

The primer shown below is the reverse complement used for PCR: 

5--CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA TTG CCT ATC GGG -3' 

he Preparation of the full fusion construct 

The camA and camB genes were amplified by the PCR using the primers described 
above. The new camA was digested with Eco RI and Bam HI, while the new CamB was 
digested with Bam HI and Hind HI. The pGLWl 1 expression vector was digested with 
Eco RI and Hindm. All three were purified by agarose gel electrophoresis and the three 
gel slices containing the separate fragments were excised from the gel and ligated together, 
and then transformed into KColi DH5a. Successful ligation of all the fragments were 
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confinned by a series of restriction digestion experiments, especially the presence of the 
new and unique Xba I site. • The entire sequence of the insert from the Eco RI site to the 
Hind III site was determined to ensure that all the sequences were correct. 

The new plasmid, named pSGB F - was transformed into EColi and expression of the 
reductase and redoxin proteins was induced by IPTG. When a purified P450 CJlro enzyme 
was added to the cell-free extract, substrate oxidation was observed for a variety of 
substrates. 

When the camC gene is cloned into the pSGB F plasmid using the Xba I and Hind 
m restriction sites, the new recombinant plasmid thus generated expresses the reductase 
and redoxin as a fusion protein and the P450 Mm enzyme as a operate entity both from the 
same mRNA molecule. This in vivo system is catalytically competent for terpene 
oxidation in whole cells. 



2. The in vivo system with the protein expressed separately 

2.a The basic strategy 

The starting point of the preparation of this in vivo system was the recombinant 
plasmid used to express the camA gene for putidaredoxin reductase. The camA gene was 
cloned into the pGLWl 1 plasmid using the Eco RI and Bam HI restriction sites, with Eco 
RI being at the 5' end of the gene. Conveniently the polylinker region of the pGLWl 1 
vector has a HindUl site downstream of the Bam HI site. The camB gene was therefore 
manipulated by PCR such that it can be cloned into pGLWl 1 using the Bam HI and Hind 
III sites. This new plasmid expresses the reductase and redoxin as separate proteins. 

The camB gene was cloned into pUCl 18 by the Bam HI and Hind m cloning sites 
to express putidaredoxin for our general in vitro substrate oxidation work. Therefore, the 
PCR primer at the 3' end of the camB gene was designed to introduce a ribosome binding 
site and the Xba I restriction site upstream of the Hind m site so that the camC gene can be 
inserted downstream of camB using the Xba I and Hind m sites. Therefore the three genes 
were cloned without fusion in the pGLWl 1 expression vector and arranged in the order 5'- 
camA-camB-camC-3', and each gene has its own RBS to initiate protein synthesis. 
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2. b Manipulations of the camB gene 

We used the internal and unique restriction site Mlu I (recognition sequence 
ACGCGT) within the camB gene as the starting point so that the PCR product has a 
different size from the PCR template fragment. The primers were as follows: 

5'- TCA TCG ACG CGT CGC GAA CTG CTG-3' 

where the Mlu I site is in bold. 

The desired coding sequence at the 3 1 end of the camB gene was: 
5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB - 1 stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3' 
RBS Xba I 

After the stop codon there is a 35 base spacer before the RBS which is used to 
initiate the synthesis of the P450 cam enzyme. The Xba I cloning site is located within the 
spacer between the RBS and the start codon (not in this primer)of the camC gene. The 
PCR primer used was the reverse complement of the sequence above. The PCR was 
carried out and the amplified fragment of the appropriate size was purified by agarose gel 
electrophoresis and the gel slice excised. 

One extra step was necessary to complete the construction of the new plasmid. The 
plasmid for the fusion protein in vivo system was digested with Mlu I and Hind ITT 
restriction enzymes, purified by agarose gel electrophoresis, and the gel slice for the small 
camB fragment excised. The pUCl 1 8 plasmid for camB expression was similarly digested, 
and the gel slice for the backbone was excised. By ligating the two fragments together we 
prepared a new pUCl 18-based plasmid which had an Xba I site followed by an Hind HI 
site downstream of the stop codon of camB. This new plasmid was digested with the Mlu I 
and Xba I enzymes and the backbone was ligated with the new camB fragment described 
above to generate a plasmid with the following arrangement of the key components: 
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..lac Promoter. .Bam HI. . camB gene.. spacer. RBS.Xba LHindlU.. 

2.c Preparation of the in vivo system pi as mid 

Once the modified camB with the Xba I and Hind IE restriction sites and 
appropriate spacers were prepared, the in vivo system was constructed by cloning this into 
5 the pGLWl 1 -based plasmid used to express the camA gene (reductase protein) using the 
Bam HI and Hind IE sites. The new in vivo system vector has the following arrangement 
of the key components: 

..tac Promoter.JEco IRI..RBS.~camA gene..spacer..Bam HI..RBS..camB 
gene..spacer.RBS. Xba I.. HindUl.. 

10 This new plasmid, named pSGB\ was transformed into R Coli and expression of 

the reductase and redoxin proteins was induced by IPTG. When a purified P450 cam enzyme 
was added to the cell-free extract, substrate oxidation was observed for a variety of 
substrates. 

When the camC gene is cloned into this pSGC + plasmid using the Xba I and Hind 
15 HI restriction sites, the new recombinant plasmid thus generated will express the three 
proteins separately, each under the direction of its own RBS but from the same mRNA 
molecule. Thus constitutes the in vivo system used in the vast majority of our terpene 
oxidation work. 

3* Introduction of an Xba I site into pRH1091 

20 This is the final step to enable the camC gene to be cloned into the in vivo systems 

by the two cloning sites Xbal and Hind in. The Xba I site was added by PCR of the entire 
pRH1091 plasmid using two primers. The presence of these two sites will also enable 
cloning of the camC gene into Ml 3 since both Xba I and Hind HI are unique in camC and 
M13. 

25 The primers shown below maintain the Hind III cloning site AAGCTT: 



5'-TCA TCG AAG CTT GGC TGT TTT-3' 
HindUl |- vector 
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At the other end the coding sequence desired was: 
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5'-ACA ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT TCA TCG-3' 
Vector - | RBS Xba I Nde I Hind III 

This sequence maintained the Nde I and Hind HI sites but the new Xba I site was 
5 introduced upstream of the Nde I site. The PCR primer used was the reverse complement 
of the desired sequence: 

5'-CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG TGA AAT TGT-3' 

The PCR product was then purified by agarose gel electrophoresis, digested with 
Hind HI and circularised with T4 DNA ligase. Success of the PCR method was indicated 
10 by the presence of a new and unique Xba I site in plasmid DNA isolated from 
transform ants. 

4. Cloning of camCinto the in vivo systems 

Ail existing camC mutants were cut out of pRH 1091 -based expression plastids with 
Nde I and Hind IE. The new vector is similarly cut with the same restriction enzymes and 
15 the camC gene cloned into this plasmid with T4 DNA ligase. This DNA is transformed 
into KColi JM109 which then may be grown to express P450 cam . 

The camC gene is excised from the new vector using Xba I and Hind III restriction 
enzymes and cloned into either the in vivo vector systems or M13mpl9 for mutagenesis. 

20 5. In vivo expression and substrate turnover 

For protein expression, cells fire grown in LBamp medium (tryptone 10 g/litre, 
yeast extract 5 g/litre, NaCl 10 g/litre, 50 jig/ml ampicillin) at 30°C until the OD 60Qam 
reaches 1.0-1.2. IPTG (isopropyl-P-D-thiogalactopyranoside) was added to a final 
concentration of 1 jiM (from a 1 M stock in H 2 0) and the culture was incubated at 30°C 
25 overnight. 

For simple screening the substrate can be added to culture and the incubation 
continued. However, due to impurities from the culture media the cells were generally 
washed twice with 0.5 vol. of buffer P, (KH 2 PQ 4 6.4 g, K 2 HP0 4 .3H 2 0 25.8 g, H 2 0 to 4 
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litres, pH 7.4) and resuspended in 0.25 vol. oxygen saturated buffer P containing 24 mM 
glucose. Substrate was added to 1 mM and the incubation continued at 30°C. The reaction 
was allowed to run for 24 hours with periodic additions of substrate and glucose. 

Example 4 

The oxidation of halo aromatic compounds 



Mutant 


2,3,6- 
Trichlorophenol 


3,4,6- 
Trichlorophenol 


Coupling 
Efficiency 
(%) 


Product 
formation 
rate (min _1) 


Y96F 


75 


25 


18 


22 


Y96A 


77 


23 


14 


33 


Y96H 


54 


46 


3 


1 


F87L-Y96F 


42 


58 


4 


8 


F87A-Y96F 


52 


48 


2 


3 


F87A-Y96-F-V247A 


43 


57 


4 


7 








Mutant 


2,3- 

Dichlorophenol 


3,4- 

Dichlorophenol 


Coupling 
Efficiency 
(%) 


Product 
formation 
rate (min _1) 


Y96A 


94 


6 


6 


19 


Y96F 


91 


9 


4 


8 


Y96A-V247L 


94 


6 


7 


20 


Y96L-V247A 


90 


10 


2 


0.7 


F87L-96F j 


96 


4 


3 


5 


C334A 


95 


5 


2 


0.5 ! 



All mutants have C334A. Coupling efficiency is the percentage of NADH 
consumed which was utilised for product formation, i.e. a percentage of the theoretical 
maximum efficiency. The product formation rates are given in (nmol product) (nmol 
p 450 cam )" 1 (min)" 1 . The relative amount of product formed in each case is shown. 
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1,3- and 1,4-dichlorobenzene, 1, 2, 3- and 1, 3, 5-trichlorobenzene, 1, 2, 4, 5- and 1, 2, 3, 5- 
tetrachlorobenzene, and 2, 3, 4, 5, 6- and 2, 2\ 4, 5, 5'- pentachlorobiphenyl were also 
found to be oxidised. 

Wild-type and mutant P450 CMn enzymes were tested for their ability to oxidise 3,3'- 
dichlorobiphenyl and 2,2',4,5,5'-pentachlorobiphenyl. Results are shown in terms of NADH 
turnover. Rates are given as nanomol NADH consumed per nanomol P450 cam enzyme per 
minute. 



P450 cam enzyme 


3,3'- 


2,2',4,5,5'- 




dichlorobiphenyl 


pentachlorobiphenyl 


Wild-type 


0.4 


not detected 


Y96F 


15 


1 


F87A-Y96F 


845 


165 j 


F87L-Y96F 


174 


13 


F87W-Y96F 


4 


3 


F87A-Y96F-V247A 


112 


12 


Y96A-V247L 


84 


37 


F87A-Y96F-L244A 


669 


321 


F87A-Y97F-L244A-V247A 


173 


214 



The first product, 4-hydroxy-3,3'-dichlorobiphenyl was identified by the 
characteristic coupling patterns expected in the *H NMR spectrum and by mass 
spectroscopy. The further oxidation product, 4,4'-dihydroxy-3,3 , -dichlorobiphenyl was 
identified by co-elution with an authentic sample, and by UV-vis and mass spectroscopy. 
This product did not constitute more than ca. 10% of the total products in any of the 
mutants tested. 

For the second substrate product was established as 4 , -hydroxy-2,2 , ,4,5,5 l - 
pentachlorobiphenyl by the observation of the parent ion in the mass spectrum, and by 
comparison with literature *H NMR data. 



WO 00/78973 



PCT/GB00/02379 



-24- 

P450 cu mutants 

Ail mutants optionally contain the base mutation C334A. 
Single mutants; Y96A, Y96F, Y96L, Y96W. 



Double mutants : 



Y96A-F87A 


Y96F-F87A 


Y96F-V295A 


Y96L-F87A 


Y96Zt-A296L 


Y96A-F87L 


Y96F-F87I 


Y96F-V295I. 


Y96I.-F87L 


Y96L-A296F 


Y96A-F87W 


Y96F-F87I, 


Y96F-V295I 


Y96I.-F98W 


Y96L-V396A 


Y96A-F98W 


Y96F-F87W 


Y96F-A296L 


Y96L-T101I, 


Y96I.-V396L 


Y96A-L244A 


Y96F-F98W 


Y96F-A296F 


Y96L-T101F 


Y96I.-V396F 


Y96A-V247A 


Y96F-T10XI. 


Y96F-I395F 


Y96L-L244A 


Y96L-V396W 


Y96A-V247I, 


Y96F-T101F 


Y96F-I395G 


Y96L-L244F 




Y96A-I395F 


Y96F-T185A 


Y96F-V396A 


Y96I*-V247A 




Y96A-I395G 


Y96F-T185F 


Y96F-V396I. 


Y96Ii-V247L 


Y96W-F87W 




Y96F-T185L 


Y96F-V396F 


Y96L-V247F 


Y96W-F98W 




Y96F-L244A 


Y96F-V396W 


Y96L-V247W 


Y96W-L244A 




Y96F-V247A 




Y96I.-G248I. 


Y96W-V247A 




Y96F-V247I. 




Y96I.-V295L 


Y96W-V396A 




Y96F-G248I. 




Y96L-V295F 





Triple Mutants : 
Y9 6A-F8 7A-I-2 4 4A 
Y96A-F87A-V247A 
Y96A-F87I < ~L244A 
Y96A-F873L-V247A 
Y9 6A-I*2 4 4A-V2 4 7 A 

Y96L-F87A-L244A 

Y9 6Ir-F8 7A-V2 47A 

Y9GL-F87I^L244A 

Y96I*-F87L-V247A 

Y96I.-V247A-I395F 

Y96Xr-V247]>I395F 

T961>V247Ii-X395G 

Y96I*-L244A-V396I* 

496&-X244A-V396F 

X96Xr-X244A-V396ff 

Y96L-I£44F-V396A 



Y9 6L-V247 A-V3 9 6Z. 
Y96I.-V247A-V396F 
Y96L-V247A-V396W 
Y96I.-V247F-V396A 

Y9 6F-F87A-L24 4A 
Y96F-F87A-V247A 
Y96F-F87A-V247I* 
Y96F-F87A-X395F 
Y96F-F87A-I395G 
Y96F-F87L-V247A 
Y96F-F87Ii-V247L 
Y96F-F87I^-X395F 
Y96F-F87W-T185A 
Y96F-F87W-T185F 
Y96F-F87W-T185I, 
Y96F-F87W-I244F 



Y96F-F87W-V247A 

Y96F-F87W-V247I. 

Y96F-F87W-V247F 

Y96F-F87W-V295L 

Y96F-F87W-A296L 

Y96F-F87W-V396A 

Y96F-F87W-V396L 

Y96F-V247F-V396A 

Y96F-L244A-V396L 

Y96F-L244A-V396F 

Y96F-L244A-V396W 

Y96F-L244F-V396A 

Y96F-V247A-V396L 

Y96F-V247A-V396F 

Y96F-V247A-V396W 

Y96W-F87W-F98W 
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CLAIMS 

1 . Process for oxidising a substrate which is a halo aromatic compound, which 
process comprises oxidising said substrate with a monooxygenase enzyme. 

2. Process according to claim 1 in which the enzyme comprises a substitution 
5 of an amino acid in the active site by an amino acid with a less polar side-chain. 

3. Process according to claim 2 in which the enzyme comprises one or more 
other amino acid substitutions in the active site. 

4. Process according to any one of the preceding claims in which the enzyme 

is 

10 (i) P450 cam ,or 

(ii) a naturally occurring homologue of (i), or 

(iii) a mutant of (i) or (ii). 

5. Process according to claim 4 in which the enzyme is one in which amino 
acid 96 of P450 cam> or the equivalent amino acid in a homologue, has been changed to an 

15 amino acid with a less polar side-chain. 

6. Process according to any one of the preceding claims in which the halogen 
is chlorine. 

7. Process according to any one of the preceding claims in which the aromatic 
compound is a benzene or biphenyl. 

20 8. Process according to any one of the preceding claims in which the substrate 

has more than one halogen atom. 

9. Process according to claim 8 in which the substrate is 1, 2-dichlorobenzene, 
1, 2, 4- trichlorobenzene, 3,3'-dichlorobiphenyl or 2,2\4,5,5 , -pentachlorobiphenyl. 

10. Process according to claim 8 in which the substrate is 
25 pentachlorob enzene or hexachlorobenzene. 

1 1 Process according to any one of the preceding claims which is carried out in 
a cell that expresses: 

(a) an enzyme as defined in any one of claims 1 to 5; 

(b) an electron transfer reductase; and 
30 (c) an electron transfer redoxin. 

12. Process according to claim 1 1 in which: 

(b) is putidaretoxin reductase or a homologue; or a fragment thereof; and/or 
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(c) is putidaretoxin or a homologue; or a fragment thereof 

13. Process according to claim 1 1 or 12 wherein the cell is one in which the 
enzyme (a) does not naturally occur. 

14. Process according to any one of claims 1 1 to 13 wherein the cell is 

5 one which in its naturally occurring form is able to oxidise a substrate as defined in any 
one of claims 6 to 10. 

15. A cell as defined in claim 14. 

16. A transgenic animal or plant whose cells are as defined in any one of 
claims 11 to 14. 

10 17. Method of treating a locus contaminated with a substrate as defined 

in any one of claims 1 or 6 to 10 comprising contacting the locus with an enzyme as 
defined in any one of claims 1 to 5 or a cell as defined in any of claims 1 1 to 13, or an 
animal or plant as defined in claim 16. 

18. Process for selecting a mutant of an enzyme as defined in claim 1, 4(i) or 
15 4(ii) for its ability to oxidise a substrate as defined in claim 1, or any one of the claims 6 to 

10, which process comprises screening a library of said mutants for their oxidation effect 
on the substrate. 

19. Process, cell, animal, plant or method according to any one of claims 1 to 17 
wherein the enzyme is one that has been selected in a process according to claim 18. 
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SEQUENCE LISTING 

<110> Isis Innovation Limited 
<120> Process for oxidising aromatic compounds 
<130> N76277A PEJ 
5 <170> Patent In Ver. 2.1 
<210> 1 
<211> 1242 
<212> ONA 

<213> Pseudomonas putida 
10 <220> 

<221> CDS 

<222> (1)..(1242) 

<400> 1 

acg act gaa acc ata caa age aac gec aat ctt gec cct ctg cca ccc 48 
15 Thr Thr Glu Thr He Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro Pro 
15 10 15 

cat gtg cca gag cac ctg gta ttc gac ttc gac atg tac aat ccg teg 96 
His Val Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro Ser 
20 25 30 

20 aat ctg tct gec ggc gtg cag gag gec tgg gca gtt ctg caa gaa tea 144 
Asn Leu Ser Ala Gly Val Gin Glu Ala Trp Ala Val Leu Gin Glu Ser 
35 40 45 

aac gta ccg gat ctg gtg tgg act cgc tgc aac ggc gga cac tgg ate 192 
Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp He 
25 50 55 60 

gec act cgc ggc caa ctg ate cgt gag gee tat gaa gat tac cgc cac 240 
Ala Thr Arg Gly Gin Leu He Arg Glu Ala Tyr Glu Asp Tyr Arg His 
65 70 75 80 

ttt tec age gag tgc ccg ttc ate cct cgt gaa gec ggc gaa gee tac 288 
30 Phe Ser Ser Glu Cys Pro Phe He Pro Arg Glu Ala Gly Glu Ala Tyr 

85 90 95 
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gac ttc att ccc acc teg atg gat ccg ccc gag cag cgc cag ttt cgt 336 
Asp Phe lie Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe Arg 
100 105 110 

gcg ctg gec aac caa gtg gtt ggc atg ccg gtg gtg gat aag ctg gag 384 
Ala Leu Ala Asn Gin Val Val Gly Met Pro Val Val Asp Lys Leu Glu 
115 120 125 

aac egg ate cag gag ctg gec tgc teg ctg ate gag age ctg cgc ccg 432 
Asn Arg He Gin Glu Leu Ala Cys Ser Leu He Glu Ser Leu Arg Pro 
130 135 140 

caa gga cag tgc aac ttc acc gag gac tac gec gaa ccc ttc ccg ata 480 
Gin Gly Gin Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro He 
145 150 155 160 

cgc ate ttc atg ctg etc gca ggt eta ccg gaa gaa gat ate ccg cac 528 
Arg He Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp He Pro His 
165 170 175 

ttg aaa tac eta acg gat cag atg acc cgt ccg gat ggc age atg acc 576 
Leu Lys Tyr Leu Thr Asp Gin Met Thr Arg Pro Asp Gly Ser Met Thr 
180 185 190 

ttc gca gag gee aag gag gcg etc tac gac tat' ctg ata ccg ate ate 624 
Phe Ala Glu Ala Lys Glu Ala Leu Tyr Asp Tyr Leu He Pro He lie 
195 200 205 

gag caa cgc agg cag aag ccg gga acc gac get ate age ate gtt gec 672 
Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala He Ser lie Val Ala 
210 215 220 

aac ggc cag gtc aat ggg cga ccg ate acc agt gac gaa gee aag agg 720 
Asn Gly Gin Val Asn Gly Arg Pro He Thr Ser Asp Glu Ala Lys Arg 
225 230 235 240 
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atg tgt ggc ctg tta ctg gtc ggc ggc ctg gat acg gtg gtc aat ttc 768 
Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn Phe 
245 250 255 

etc age ttc age atg gag ttc ctg gee aaa age ccg gag cat cgc cag 816 
Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg Gin 
260 265 270 

gag ctg ate gag cgt ccc gag cgt att cca gec get tgc gag gaa eta 864 
Glu Leu He Glu Arg Pro Glu Arg He Pro Ala Ala Cys Glu Glu Leu 
275 280 285 

etc egg cgc ttc teg ctg gtt gee gat ggc cgc ate etc acc tec gat 912 
Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg He Leu Thr Ser Asp 
290 295 300 

tac gag ttt cat ggc gtg caa ctg aag aaa ggt gac cag ate ctg eta 960 
Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin lie Leu Leu 
305 310 315 320 

ccg cag atg ctg tct ggc ctg gat gag cgc gaa aac gee tgc ccg atg 1008 
Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro Met 
325 330 335 

cac gtc gac ttc agt cgc caa aag gtt tea cac acc acc ttt ggc cac 1056 
His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly His 
340 345 350 

ggc age cat ctg tgc ctt ggc cag cac ctg gec cgc egg gaa ate ate 1104 
Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu He He 
355 360 365 

gtc acc etc aag gaa tgg ctg acc agg att cct gac ttc tec att gee 1152 
Val Thr Leu Lys Glu Trp Leu Thr Arg lie Pro Asp Phe Ser He Ala 
370 375 380 
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ccg ggt gcc cag att cag cac aag age ggc ate gtc age ggc gtg cag 1200 
Pro Gly Ala Gin He Gin Hi * Lys Ser Gly He Val Ser Gly Val Gin 
385 390 395 400 

gca etc cct ctg gtc tgg gat ccg gcg act acc aaa gcg gta 1242 
Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 
405 410 

<210> 2 
<211> 3150 
<212> DNA 

<213> Bacillus megaterium 

<220> 

<221> CDS 

<222> (D..C3150) 

<400> 2 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 48 
Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 192 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
6 5 70 75 80 
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gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 



288 



tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 
Trp Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala 
100 105 no 



336 



atg aaa ggc tat cat gcg atg atg gtc gat ate gec gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 



caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 UO 



15 



gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 
Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 160 



480 



tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 
165 170 175 



528 



agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 
180 185 190 



aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 
Asp He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 
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aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 720 
Lys Ala Ser Gly Glu Gin Sen Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 768 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 
245 250 255 



tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 



caa aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct get cca 912 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Ala Pro 
290 295 300 



age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 



960 



gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
325 330 335 



aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 
340 345 350 



25 



gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 1104 
Glu Leu Met Val Leu He Pro Gin Leu. His Arg Asp Lys Thr lie Trp 
355 360 365 
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gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 1152 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 1200 
Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 400 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 1248 
Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
405 410 415 

atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 1296 
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 
420 425 430 



gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 1344 
Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
15 435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 1392 
Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 



10 



20 



gaa cag tct gec aaa aaa gca cgc aaa aag gca gaa aac get cat aat 1440 
Glu Gin Ser Ala Lys Lys Ala Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 1488 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
485 490 495 



25 acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 
Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 
500 505 510 



1536 



WO 00/78973 



PCT/GB00/02379 



-8- 

cag gtc gca acg ctt gat tea cac gec gga aat ctt ccg cgc gaa gga 1584 
Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 1632 
Ala Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 1680 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 560 

aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 1728 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get 1776 
Thr Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala 
580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 1824 
Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 1872 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 1920 
Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 1968 
Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
645 650 655 
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gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 



2016 



ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu 
675 680 685 



2064 



ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 



2112 



10 



cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 
Pro Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 



2160 



15 



eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 
Leu Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu 
725 730 735 



2208 



get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 



2256 



20 



tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 



2304 



get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gee ttg 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 



2352 



25 



ctt gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 



2400 
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atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 2448 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 

gaa ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att 2496 
5 Glu Phe lie Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He 
820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 
835 840 845 

10 gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
850 855 860 

gcg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 2640 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
15 865 870 875 880 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 2688 
Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 2736 
20 Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 2784 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 



25 gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 



2832 
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ctg tat caa gaa gag ctt gaa aac gcc caa age gaa ggc ate att acg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr 
945 950 955 g60 

ctt cat acc get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 2928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 
980 985 99 0 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro 
"5 1000 1005 

gcc gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
15 1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 



10 



20 



cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 

1045 1050 
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1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this report as "originally filed" 
and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7)f. 
. Description, pages: 

1 -25 as originally filed 
Claims, No.: 

1-18 as received on 21/08/2001 with letter of 20/08/2001 
Claims, pages: 

26,27 as received on 21/08/2001 with letter of 20/08/2001 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

Q the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

S the claims, Nos.: 1-19 
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considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 



6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


1-13, 15-18 




No: 


Claims 


14 


Inventive step (IS) 


Yes: 


Claims 


none 




No: 


Claims 


1-18 


Industrial applicability (IA) 


Yes: 


Claims 


1-18 




No: 


Claims 





2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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This IPER is bas d on th following prior art documents: 
D1: 

SHIMOJI M ET AL: 'DESIGN OF A NOVEL P450: A FUNCTIONAL BACTERIAL- 
HUMAN CYTOCHROME P450 CHIMERA 1 BIOCHEMISTRY, vol. 37, no. 25, 1998, 
pages 8848-8852, XP002913528 ISSN: 0006-2960 
D2: 

WO 96 14419 A (BRITISH GAS PLC ;FLITSCH SABINE LAHJA (GB); NICKERSON 
DARREN PAUL) 17 May 1996 (1996-05-17) 

Section V 

Claim 14 of present application cannot be considered novel since the features defined 
in said claim may be inherent property of naturally occurring Pseudomonas putida. 
Hence, claim 14 is in discordance to Article 33(2) PCT. 

Closest prior art to the subject matter of claims 1 - 9 is regarded as being D2. 

The problem to be solved can be seen in the provision of a method to oxidise specific 
halo aromatic compounds on a ring carbon atom. 

This problem could be solved by using the properties of the wild type and of active site 
mutants of P450cam to oxidise such compounds. 

The subject matter of claims 1-9 differs from D2 in that said P450cam and active site 
mutants thereof are used in a process for oxidation of 1 ,2-dichlorbenzene, 1 ,2,4- 
trichlorbenzene, 3,3'- dichlorbiphenyl, 2,2 , ,4,5,5 , -pentachlorbiphenyl, 
pentachlorbenzene or hexachlorobenzene. 

However, the definition of new substrates for a known enzyme cannot be considered to 
fulfill the requirements of Article 33 (3) PCT for the following reasons: 

1. 

D2 already discloses a method to oxidise condensed aromatic compounds, such as 
naphthalene, phenanthrene, fluoranthrene and pyrene where the oxidised carbon is a 
ring carbon of the substrate (see for instance Figures 4 c-f). 
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2. 

D2 also suggests a method for oxidising halo aromatic compounds (e.g. diphenyl and 
biphenyl compounds and their halogenated variants, see for instance page 4, lines 10- 
27 and claim 5) using mutants in the active site of the P. putida's mono-oxygenase 
P450cam, where said mutants include among others a Y96A mutant (see e.g. page 6 
first paragraph, page 8 last paragraph, claim 7 and Fig.1). D2 also teaches how to 
modify the active site of the wild type enzyme in order to decrease its specificity 
towards camphor, creating a desired "aromatic pocket" (see e.g. pages 3 and 4) and 
that the amino acid which replaces Y96 is conveniently a small hydrophobic amino acid 
(for instance a non-polar one) like Ala, Gly, Val, Leu or He. 

Thus, in view of D2 the elucidation of further mono-oxygenase substrates chemically 
closely related to known ones is obvious to somebody skilled in the art. 

In view of D2 the subject matter of claims 10, 1 1 , 13 and 18 is obvious to the skilled 
artisan, contrary to the requirements of Article 33(3) PCT. 

The subject matter of claim 12 of present international application cannot be considered 
to fulfill the requirements of Article 33 (3) PCT, since expression of known exogenous 
genes in prokaryotic/eucaryotic cells with the aim to conduct a biochemical process in 
said cells is obvious to the person skilled in the art. 

Claim 15 does not fulfill the requirements of Article 33 (3) PCT, since the use of known 
genes (see e.g. D2, pages 3-6) to create transgenic animals or plants merely can be 
seen as routine methods. 

Document D1 suggests that modified P450 enzymes can be used for bioremediation of 
areas polluted with halo aromatic compounds (see e.g. abstract and page 8852). 
Hence, subject matter of claim 16 is in discordance to Article 33 (3) PCT. 

Claim 17 does not fulfill the requirements of Article 33(3) PCT since a process for the 
selection of mutants of a known enzyme with well documented and chemically related 
substrates is seen as being a routine method for somebody skilled in the art. 

Section VIII 
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1) 

The numbering of claim 19 of new set of claims filed with the letter of 20 August 2001 is 
considered a typing error and is referred to in this report as claim 18. 

2) 

Applicants attention is drawn to the fact that in the absence of a reference point terms 
like "less polar" and "homologue" are meaningless. Thus, claims containing at least one 
of the above mentioned expressions are unclear and does not meet the requirements 
of Article 6 PCT. 

3) 

It is highly questionable that any fragment of the putidaredoxin reductase and/or 
putidaredoxin are actually suitable for the process according to claim 11 since in its 
broadest meaning the term "fragment" cover individual amino acids (Article 5 and 6 
PCT). 
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• 26 - 
CLAIMS 

1. Process for oxidising a halo aromatic substrate which has more than one 
halogen atom, which process comprises oxidising said substrate with a monooxygenase 
enzyme, wherein a ring carbon of the substrate is oxidised. 

2. Process according to claim 1 in which the enzyme comprises a substitution 
of an amino acid in the active site by an amino acid with a less polar side-chain. 

3. Process according to claim 2 in which the enzyme comprises one or more 
other amino acid substitutions in the active site. 

4. Process according to any one of the preceding claims in which the enzyme 

is: 

© P450 cam ,or 

(ii) a naturally occurring homologue of (i), or 

(iii) a mutant of (i) or (ii). 

5. Process according to claim 4 in which the enzyme is one in which amino 
acid 96 of P450 cam or the equivalent amino acid in a homologue, has been changed to an 
amino acid with a less polar side-chain. 

6. Process according to any one of the preceding claims in which the halogen is 
chlorine. 

7. Process according to any one of the preceding claims in which the aromatic 
compound is a benzene or biphenyl. 

8. Process for oxidising a halo aromatic substrate, which process comprises 
oxidising said substrate with an enzyme as defined in any one of claims 1 to 5, wherein the 
substrate is 1, 2-dichlorobenzene, 1, 2, 4- trichlorobenzene, 3,3'-dichlorobiphenyl or 

2 , 2 1 , 4,5 , 5 1 -pentachlo robiphenyl . 

9. Process for oxidising a halo aromatic substrate, which process. comprises 
oxidising said substrate with an enzyme as defined in any one of claims 1 to 5, wherein the 
substrate is pentachlorobenzene or hexachlorobenzene. 

10. Process according to any one of the preceding claims which is carried out in 
a cell that expresses: 

(a) an enzyme as defined, in any one of claims 1 to 5; 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 
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11. Process according to claim 10 in which: 

(b) is putidaretoxin reductase or a homologue; or a fragment thereof; and/or 

(c) is putidaretoxin or a homologue; or a fragment thereof. 

12. Process according to claim 10 or 11 wherein the cell is one in which the 
enzyme (a) does not naturally occur. 

13. Process according to any one of claims 10 to 12 wherein the cell is 

one which in its naturally occurring form is able to oxidise a substrate as defined in any 
one of claims 6 to 9. 

14. A cell as defined in claim 13. 

15. A transgenic animal or plant whose cells are as defined in any one of 
claims 10 to 13. 

16. Method of treating a locus contaminated with a substrate as defined 

in any one of claims 1 or 6 to 9 comprising contacting the locus with an enzyme as defined 
in any one of claims 1 to 5 or a cell as defined in any of claims 10 to 12, or an animal or ■ 
plant as defined in claim 15. 

17. Process for selecting a mutant of an enzyme as defined in claim 1, 4(i) or 
4(ii) for its ability to oxidise a substrate as defined in claim 1, or any one of the claims 6 to 
9, which process comprises screening a library of said mutants for their oxidation effect on 
the substrate. 

13 1# Process, cell, animal, plant or method according to any one of claims 1 to 16 
wherein the enzyme is one that has been selected in a process according to claim 17. 
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